Overview
Brought to you by YData
Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 656 |
| Missing cells | 3123 |
| Missing cells (%) | 20.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 526.4 KiB |
| Average record size in memory | 821.7 B |
Variable types
| Text | 6 |
|---|---|
| Numeric | 11 |
| Categorical | 4 |
| Boolean | 1 |
| URL | 1 |
exclude has constant value "True" | Constant |
column has constant value "http://www.shadowandact.com/?p=23430" | Constant |
audience score is highly overall correlated with rotten tomatoes | High correlation |
box office average per cinema is highly overall correlated with domestic gross and 4 other fields | High correlation |
budget is highly overall correlated with domestic gross and 4 other fields | High correlation |
domestic gross is highly overall correlated with box office average per cinema and 6 other fields | High correlation |
foreign gross is highly overall correlated with box office average per cinema and 6 other fields | High correlation |
genre is highly overall correlated with id | High correlation |
id is highly overall correlated with genre and 3 other fields | High correlation |
lead studio is highly overall correlated with id | High correlation |
number of theatres in opening weekend is highly overall correlated with budget and 5 other fields | High correlation |
opening weekend is highly overall correlated with box office average per cinema and 6 other fields | High correlation |
profit is highly overall correlated with box office average per cinema and 5 other fields | High correlation |
rotten tomatoes is highly overall correlated with audience score | High correlation |
story is highly overall correlated with id | High correlation |
worldwide gross is highly overall correlated with box office average per cinema and 6 other fields | High correlation |
year is highly overall correlated with id | High correlation |
exclude has 528 (80.5%) missing values | Missing |
lead studio has 109 (16.6%) missing values | Missing |
number of theatres in opening weekend has 45 (6.9%) missing values | Missing |
box office average per cinema has 54 (8.2%) missing values | Missing |
foreign gross has 55 (8.4%) missing values | Missing |
budget has 11 (1.7%) missing values | Missing |
proftitability has 11 (1.7%) missing values | Missing |
oscar has 640 (97.6%) missing values | Missing |
bafta has 644 (98.2%) missing values | Missing |
source has 351 (53.5%) missing values | Missing |
column has 655 (99.8%) missing values | Missing |
id is uniformly distributed | Uniform |
movies_id has unique values | Unique |
id has unique values | Unique |
Reproduction
| Analysis started | 2025-09-15 10:05:52.135734 |
|---|---|
| Analysis finished | 2025-09-15 10:06:35.909159 |
| Duration | 43.77 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
movies_id
Text
Unique 
| Distinct | 656 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.6 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 656 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | movies-0000 |
|---|---|
| 2nd row | movies-0001 |
| 3rd row | movies-0002 |
| 4th row | movies-0003 |
| 5th row | movies-0004 |
| Value | Count | Frequency (%) |
| movies-0000 | 1 | 0.2% |
| movies-0020 | 1 | 0.2% |
| movies-0009 | 1 | 0.2% |
| movies-0002 | 1 | 0.2% |
| movies-0003 | 1 | 0.2% |
| movies-0004 | 1 | 0.2% |
| movies-0005 | 1 | 0.2% |
| movies-0006 | 1 | 0.2% |
| movies-0007 | 1 | 0.2% |
| movies-0008 | 1 | 0.2% |
| Other values (646) | 646 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 892 | |
| m | 656 | |
| v | 656 | |
| i | 656 | |
| e | 656 | |
| s | 656 | |
| - | 656 | |
| o | 656 | |
| 1 | 236 | 3.3% |
| 4 | 236 | 3.3% |
| Other values (7) | 1260 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7216 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 892 | |
| m | 656 | |
| v | 656 | |
| i | 656 | |
| e | 656 | |
| s | 656 | |
| - | 656 | |
| o | 656 | |
| 1 | 236 | 3.3% |
| 4 | 236 | 3.3% |
| Other values (7) | 1260 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7216 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 892 | |
| m | 656 | |
| v | 656 | |
| i | 656 | |
| e | 656 | |
| s | 656 | |
| - | 656 | |
| o | 656 | |
| 1 | 236 | 3.3% |
| 4 | 236 | 3.3% |
| Other values (7) | 1260 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7216 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 892 | |
| m | 656 | |
| v | 656 | |
| i | 656 | |
| e | 656 | |
| s | 656 | |
| - | 656 | |
| o | 656 | |
| 1 | 236 | 3.3% |
| 4 | 236 | 3.3% |
| Other values (7) | 1260 |
id
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 656 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 328.5 |
| Minimum | 1 |
|---|---|
| Maximum | 656 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 33.75 |
| Q1 | 164.75 |
| median | 328.5 |
| Q3 | 492.25 |
| 95-th percentile | 623.25 |
| Maximum | 656 |
| Range | 655 |
| Interquartile range (IQR) | 327.5 |
Descriptive statistics
| Standard deviation | 189.51517 |
|---|---|
| Coefficient of variation (CV) | 0.57691072 |
| Kurtosis | -1.2 |
| Mean | 328.5 |
| Median Absolute Deviation (MAD) | 164 |
| Skewness | 0 |
| Sum | 215496 |
| Variance | 35916 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.2% |
| 432 | 1 | 0.2% |
| 434 | 1 | 0.2% |
| 435 | 1 | 0.2% |
| 436 | 1 | 0.2% |
| 437 | 1 | 0.2% |
| 438 | 1 | 0.2% |
| 439 | 1 | 0.2% |
| 440 | 1 | 0.2% |
| 441 | 1 | 0.2% |
| Other values (646) | 646 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 656 | 1 | |
| 655 | 1 | |
| 654 | 1 | |
| 653 | 1 | |
| 652 | 1 | |
| 651 | 1 | |
| 650 | 1 | |
| 649 | 1 | |
| 648 | 1 | |
| 647 | 1 |
year
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.4 KiB |
| 200 | |
|---|---|
| 201 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 201 |
|---|---|
| 2nd row | 201 |
| 3rd row | 201 |
| 4th row | 201 |
| 5th row | 201 |
Common Values
| Value | Count | Frequency (%) |
| 200 | 376 | |
| 201 | 280 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 200 | 376 | |
| 201 | 280 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 2 | 656 | |
| 1 | 280 | 14.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1968 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 2 | 656 | |
| 1 | 280 | 14.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1968 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 2 | 656 | |
| 1 | 280 | 14.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1968 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1032 | |
| 2 | 656 | |
| 1 | 280 | 14.2% |
exclude
Boolean
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 528 |
| Missing (%) | 80.5% |
| Memory size | 21.1 KiB |
| True | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 128 | 19.5% |
| (Missing) | 528 |
film
Text
| Distinct | 653 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Memory size | 41.3 KiB |
Length
| Max length | 56 |
|---|---|
| Median length | 40 |
| Mean length | 15.29313 |
| Min length | 1 |
Unique
| Unique | 651 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | 127 Hours |
|---|---|
| 2nd row | A Nightmare on Elm Street |
| 3rd row | Alice in Wonderland |
| 4th row | All About Steve |
| 5th row | All Good Things |
| Value | Count | Frequency (%) |
| the | 205 | 11.3% |
| of | 59 | 3.3% |
| and | 25 | 1.4% |
| a | 17 | 0.9% |
| in | 16 | 0.9% |
| 2 | 15 | 0.8% |
| to | 11 | 0.6% |
| love | 10 | 0.6% |
| you | 10 | 0.6% |
| i | 10 | 0.6% |
| Other values (1050) | 1435 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1157 | 11.6% | |
| e | 1009 | 10.1% |
| a | 611 | 6.1% |
| r | 588 | 5.9% |
| o | 584 | 5.8% |
| n | 530 | 5.3% |
| t | 509 | 5.1% |
| i | 508 | 5.1% |
| s | 406 | 4.1% |
| h | 395 | 3.9% |
| Other values (65) | 3720 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1157 | 11.6% | |
| e | 1009 | 10.1% |
| a | 611 | 6.1% |
| r | 588 | 5.9% |
| o | 584 | 5.8% |
| n | 530 | 5.3% |
| t | 509 | 5.1% |
| i | 508 | 5.1% |
| s | 406 | 4.1% |
| h | 395 | 3.9% |
| Other values (65) | 3720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1157 | 11.6% | |
| e | 1009 | 10.1% |
| a | 611 | 6.1% |
| r | 588 | 5.9% |
| o | 584 | 5.8% |
| n | 530 | 5.3% |
| t | 509 | 5.1% |
| i | 508 | 5.1% |
| s | 406 | 4.1% |
| h | 395 | 3.9% |
| Other values (65) | 3720 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1157 | 11.6% | |
| e | 1009 | 10.1% |
| a | 611 | 6.1% |
| r | 588 | 5.9% |
| o | 584 | 5.8% |
| n | 530 | 5.3% |
| t | 509 | 5.1% |
| i | 508 | 5.1% |
| s | 406 | 4.1% |
| h | 395 | 3.9% |
| Other values (65) | 3720 |
lead studio
Categorical
High correlation  Missing 
| Distinct | 46 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 109 |
| Missing (%) | 16.6% |
| Memory size | 37.5 KiB |
| Independent | |
|---|---|
| Paramount | |
| Warner Bros. | |
| Universal | |
| Sony | |
| Other values (41) |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 9.8372943 |
| Min length | 3 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Independent |
|---|---|
| 2nd row | Warner Bros. |
| 3rd row | Disney |
| 4th row | Independent |
| 5th row | Independent |
Common Values
| Value | Count | Frequency (%) |
| Independent | 110 | |
| Paramount | 55 | |
| Warner Bros. | 53 | |
| Universal | 46 | |
| Sony | 42 | 6.4% |
| Fox | 41 | 6.2% |
| Disney | 35 | 5.3% |
| Independant | 32 | 4.9% |
| Lionsgate | 19 | 2.9% |
| Relativity Media | 15 | 2.3% |
| Other values (36) | 99 | |
| (Missing) | 109 |
Length
| Value | Count | Frequency (%) |
| independent | 111 | |
| warner | 64 | 9.0% |
| bros | 64 | 9.0% |
| paramount | 55 | 7.8% |
| fox | 50 | 7.1% |
| universal | 46 | 6.5% |
| sony | 43 | 6.1% |
| disney | 35 | 4.9% |
| independant | 32 | 4.5% |
| relativity | 20 | 2.8% |
| Other values (46) | 189 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 798 | |
| e | 704 | |
| a | 368 | 6.8% |
| t | 348 | 6.5% |
| r | 346 | 6.4% |
| d | 312 | 5.8% |
| o | 275 | 5.1% |
| i | 251 | 4.7% |
| s | 215 | 4.0% |
| p | 172 | 3.2% |
| Other values (37) | 1592 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5381 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 798 | |
| e | 704 | |
| a | 368 | 6.8% |
| t | 348 | 6.5% |
| r | 346 | 6.4% |
| d | 312 | 5.8% |
| o | 275 | 5.1% |
| i | 251 | 4.7% |
| s | 215 | 4.0% |
| p | 172 | 3.2% |
| Other values (37) | 1592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5381 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 798 | |
| e | 704 | |
| a | 368 | 6.8% |
| t | 348 | 6.5% |
| r | 346 | 6.4% |
| d | 312 | 5.8% |
| o | 275 | 5.1% |
| i | 251 | 4.7% |
| s | 215 | 4.0% |
| p | 172 | 3.2% |
| Other values (37) | 1592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5381 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 798 | |
| e | 704 | |
| a | 368 | 6.8% |
| t | 348 | 6.5% |
| r | 346 | 6.4% |
| d | 312 | 5.8% |
| o | 275 | 5.1% |
| i | 251 | 4.7% |
| s | 215 | 4.0% |
| p | 172 | 3.2% |
| Other values (37) | 1592 |
rotten tomatoes
Real number (ℝ)
High correlation 
| Distinct | 100 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.99084 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 26 |
| median | 48 |
| Q3 | 72 |
| 95-th percentile | 92 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 26.659191 |
|---|---|
| Coefficient of variation (CV) | 0.54416685 |
| Kurtosis | -1.1594447 |
| Mean | 48.99084 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.084812409 |
| Sum | 32089 |
| Variance | 710.71245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 15 | 2.3% |
| 14 | 15 | 2.3% |
| 27 | 15 | 2.3% |
| 52 | 13 | 2.0% |
| 78 | 12 | 1.8% |
| 19 | 11 | 1.7% |
| 68 | 10 | 1.5% |
| 46 | 10 | 1.5% |
| 38 | 10 | 1.5% |
| 13 | 10 | 1.5% |
| Other values (90) | 534 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 1 | 1 | 0.2% |
| 2 | 3 | 0.5% |
| 3 | 2 | 0.3% |
| 4 | 5 | |
| 5 | 2 | 0.3% |
| 6 | 5 | |
| 7 | 6 | |
| 8 | 4 | |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 99 | 1 | 0.2% |
| 98 | 2 | 0.3% |
| 97 | 3 | 0.5% |
| 96 | 4 | |
| 95 | 3 | 0.5% |
| 94 | 9 | |
| 93 | 9 | |
| 92 | 6 | |
| 91 | 5 | |
| 90 | 5 |
audience score
Real number (ℝ)
High correlation 
| Distinct | 73 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.910061 |
| Minimum | 19 |
|---|---|
| Maximum | 96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 47 |
| median | 59 |
| Q3 | 73 |
| 95-th percentile | 87 |
| Maximum | 96 |
| Range | 77 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 16.579657 |
|---|---|
| Coefficient of variation (CV) | 0.27674245 |
| Kurtosis | -0.78875615 |
| Mean | 59.910061 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.015848479 |
| Sum | 39301 |
| Variance | 274.88503 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72 | 21 | 3.2% |
| 50 | 20 | 3.0% |
| 56 | 19 | 2.9% |
| 73 | 19 | 2.9% |
| 57 | 19 | 2.9% |
| 61 | 16 | 2.4% |
| 47 | 16 | 2.4% |
| 55 | 15 | 2.3% |
| 43 | 15 | 2.3% |
| 48 | 15 | 2.3% |
| Other values (63) | 481 |
| Value | Count | Frequency (%) |
| 19 | 1 | 0.2% |
| 20 | 1 | 0.2% |
| 22 | 1 | 0.2% |
| 24 | 2 | 0.3% |
| 25 | 1 | 0.2% |
| 26 | 2 | 0.3% |
| 27 | 3 | |
| 28 | 5 | |
| 29 | 4 | |
| 31 | 7 |
| Value | Count | Frequency (%) |
| 96 | 1 | 0.2% |
| 93 | 4 | |
| 92 | 2 | 0.3% |
| 91 | 5 | |
| 90 | 6 | |
| 89 | 6 | |
| 88 | 5 | |
| 87 | 9 | |
| 86 | 6 | |
| 85 | 5 |
story
Categorical
High correlation 
| Distinct | 22 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 2 |
| Missing (%) | 0.3% |
| Memory size | 36.9 KiB |
| Comedy | |
|---|---|
| Love | |
| Monster Force | |
| Quest | |
| Rivalry | |
| Other values (17) |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 8.4266055 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Escape |
|---|---|
| 2nd row | Monster Force |
| 3rd row | Journey And Return |
| 4th row | Comedy |
| 5th row | The Riddle |
Common Values
| Value | Count | Frequency (%) |
| Comedy | 87 | |
| Love | 73 | 11.1% |
| Monster Force | 63 | 9.6% |
| Quest | 59 | 9.0% |
| Rivalry | 38 | 5.8% |
| Discovery | 36 | 5.5% |
| Pursuit | 32 | 4.9% |
| Transformation | 27 | 4.1% |
| Revenge | 26 | 4.0% |
| Maturation | 26 | 4.0% |
| Other values (12) | 187 |
Length
| Value | Count | Frequency (%) |
| comedy | 87 | 10.3% |
| love | 73 | 8.7% |
| monster | 63 | 7.5% |
| force | 63 | 7.5% |
| quest | 59 | 7.0% |
| rivalry | 38 | 4.5% |
| discovery | 36 | 4.3% |
| pursuit | 32 | 3.8% |
| transformation | 27 | 3.2% |
| revenge | 26 | 3.1% |
| Other values (22) | 339 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 729 | 13.2% |
| o | 481 | 8.7% |
| r | 430 | 7.8% |
| t | 325 | 5.9% |
| s | 322 | 5.8% |
| n | 267 | 4.8% |
| i | 259 | 4.7% |
| a | 235 | 4.3% |
| u | 232 | 4.2% |
| d | 209 | 3.8% |
| Other values (27) | 2022 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5511 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 729 | 13.2% |
| o | 481 | 8.7% |
| r | 430 | 7.8% |
| t | 325 | 5.9% |
| s | 322 | 5.8% |
| n | 267 | 4.8% |
| i | 259 | 4.7% |
| a | 235 | 4.3% |
| u | 232 | 4.2% |
| d | 209 | 3.8% |
| Other values (27) | 2022 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5511 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 729 | 13.2% |
| o | 481 | 8.7% |
| r | 430 | 7.8% |
| t | 325 | 5.9% |
| s | 322 | 5.8% |
| n | 267 | 4.8% |
| i | 259 | 4.7% |
| a | 235 | 4.3% |
| u | 232 | 4.2% |
| d | 209 | 3.8% |
| Other values (27) | 2022 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5511 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 729 | 13.2% |
| o | 481 | 8.7% |
| r | 430 | 7.8% |
| t | 325 | 5.9% |
| s | 322 | 5.8% |
| n | 267 | 4.8% |
| i | 259 | 4.7% |
| a | 235 | 4.3% |
| u | 232 | 4.2% |
| d | 209 | 3.8% |
| Other values (27) | 2022 |
genre
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.6 KiB |
| Comedy | |
|---|---|
| Action | |
| Drama | |
| Horror | |
| Animation | |
| Other values (7) |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.4314024 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adventure |
|---|---|
| 2nd row | Horror |
| 3rd row | Adventure |
| 4th row | Comedy |
| 5th row | Drama |
Common Values
| Value | Count | Frequency (%) |
| Comedy | 175 | |
| Action | 159 | |
| Drama | 98 | |
| Horror | 50 | 7.6% |
| Animation | 49 | 7.5% |
| Thriller | 33 | 5.0% |
| Adventure | 31 | 4.7% |
| Romance | 22 | 3.4% |
| Crime | 16 | 2.4% |
| Biography | 13 | 2.0% |
| Other values (2) | 10 | 1.5% |
Length
| Value | Count | Frequency (%) |
| comedy | 175 | |
| action | 159 | |
| drama | 98 | |
| horror | 50 | 7.6% |
| animation | 49 | 7.5% |
| thriller | 33 | 5.0% |
| adventure | 31 | 4.7% |
| romance | 22 | 3.4% |
| crime | 16 | 2.4% |
| biography | 13 | 2.0% |
| Other values (2) | 10 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 523 | |
| r | 384 | 9.1% |
| m | 365 | 8.7% |
| i | 319 | 7.6% |
| e | 318 | 7.5% |
| n | 315 | 7.5% |
| a | 285 | 6.8% |
| t | 249 | 5.9% |
| A | 239 | 5.7% |
| d | 206 | 4.9% |
| Other values (16) | 1016 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4219 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 523 | |
| r | 384 | 9.1% |
| m | 365 | 8.7% |
| i | 319 | 7.6% |
| e | 318 | 7.5% |
| n | 315 | 7.5% |
| a | 285 | 6.8% |
| t | 249 | 5.9% |
| A | 239 | 5.7% |
| d | 206 | 4.9% |
| Other values (16) | 1016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4219 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 523 | |
| r | 384 | 9.1% |
| m | 365 | 8.7% |
| i | 319 | 7.6% |
| e | 318 | 7.5% |
| n | 315 | 7.5% |
| a | 285 | 6.8% |
| t | 249 | 5.9% |
| A | 239 | 5.7% |
| d | 206 | 4.9% |
| Other values (16) | 1016 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4219 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 523 | |
| r | 384 | 9.1% |
| m | 365 | 8.7% |
| i | 319 | 7.6% |
| e | 318 | 7.5% |
| n | 315 | 7.5% |
| a | 285 | 6.8% |
| t | 249 | 5.9% |
| A | 239 | 5.7% |
| d | 206 | 4.9% |
| Other values (16) | 1016 |
number of theatres in opening weekend
Real number (ℝ)
High correlation  Missing 
| Distinct | 532 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 45 |
| Missing (%) | 6.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2698.347 |
| Minimum | 2 |
|---|---|
| Maximum | 4468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 631.5 |
| Q1 | 2396 |
| median | 2840 |
| Q3 | 3320 |
| 95-th percentile | 3962 |
| Maximum | 4468 |
| Range | 4466 |
| Interquartile range (IQR) | 924 |
Descriptive statistics
| Standard deviation | 952.1302 |
|---|---|
| Coefficient of variation (CV) | 0.35285684 |
| Kurtosis | 1.1428928 |
| Mean | 2698.347 |
| Median Absolute Deviation (MAD) | 465 |
| Skewness | -1.09158 |
| Sum | 1648690 |
| Variance | 906551.91 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 8 | 1.2% |
| 2470 | 3 | 0.5% |
| 3175 | 3 | 0.5% |
| 2511 | 3 | 0.5% |
| 3030 | 3 | 0.5% |
| 3606 | 3 | 0.5% |
| 3121 | 3 | 0.5% |
| 2534 | 3 | 0.5% |
| 2756 | 3 | 0.5% |
| 2707 | 2 | 0.3% |
| Other values (522) | 577 | |
| (Missing) | 45 | 6.9% |
| Value | Count | Frequency (%) |
| 2 | 2 | 0.3% |
| 3 | 2 | 0.3% |
| 4 | 8 | |
| 6 | 2 | 0.3% |
| 9 | 1 | 0.2% |
| 11 | 1 | 0.2% |
| 22 | 1 | 0.2% |
| 29 | 1 | 0.2% |
| 36 | 1 | 0.2% |
| 100 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 4468 | 1 | |
| 4380 | 1 | |
| 4375 | 1 | |
| 4366 | 1 | |
| 4362 | 1 | |
| 4359 | 1 | |
| 4325 | 1 | |
| 4285 | 1 | |
| 4260 | 1 | |
| 4252 | 1 |
box office average per cinema
Real number (ℝ)
High correlation  Missing 
| Distinct | 579 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 54 |
| Missing (%) | 8.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8172.3322 |
| Minimum | 1052 |
|---|---|
| Maximum | 93230 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 1052 |
|---|---|
| 5-th percentile | 2243.8 |
| Q1 | 3785 |
| median | 5943 |
| Q3 | 9741 |
| 95-th percentile | 22282.2 |
| Maximum | 93230 |
| Range | 92178 |
| Interquartile range (IQR) | 5956 |
Descriptive statistics
| Standard deviation | 7865.2285 |
|---|---|
| Coefficient of variation (CV) | 0.96242153 |
| Kurtosis | 29.296118 |
| Mean | 8172.3322 |
| Median Absolute Deviation (MAD) | 2567.5 |
| Skewness | 4.1680711 |
| Sum | 4919744 |
| Variance | 61861820 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4761 | 3 | 0.5% |
| 3132 | 2 | 0.3% |
| 4605 | 2 | 0.3% |
| 1806 | 2 | 0.3% |
| 9743 | 2 | 0.3% |
| 3610 | 2 | 0.3% |
| 4405 | 2 | 0.3% |
| 7655 | 2 | 0.3% |
| 5427 | 2 | 0.3% |
| 3300 | 2 | 0.3% |
| Other values (569) | 581 | |
| (Missing) | 54 | 8.2% |
| Value | Count | Frequency (%) |
| 1052 | 1 | |
| 1354 | 1 | |
| 1459 | 1 | |
| 1490 | 1 | |
| 1513 | 1 | |
| 1559 | 1 | |
| 1575 | 1 | |
| 1585 | 1 | |
| 1625 | 1 | |
| 1703 | 1 |
| Value | Count | Frequency (%) |
| 93230 | 1 | |
| 61777 | 1 | |
| 45429 | 1 | |
| 41890 | 1 | |
| 41038 | 1 | |
| 40385 | 1 | |
| 39384 | 1 | |
| 38672 | 1 | |
| 36338 | 1 | |
| 36283 | 1 |
domestic gross
Real number (ℝ)
High correlation 
| Distinct | 623 |
|---|---|
| Distinct (%) | 95.8% |
| Missing | 6 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.996292 |
| Minimum | 0 |
|---|---|
| Maximum | 743.8 |
| Zeros | 3 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.427 |
| Q1 | 20.22 |
| median | 40.195 |
| Q3 | 88.8675 |
| 95-th percentile | 222.061 |
| Maximum | 743.8 |
| Range | 743.8 |
| Interquartile range (IQR) | 68.6475 |
Descriptive statistics
| Standard deviation | 77.446419 |
|---|---|
| Coefficient of variation (CV) | 1.13898 |
| Kurtosis | 13.121705 |
| Mean | 67.996292 |
| Median Absolute Deviation (MAD) | 26.585 |
| Skewness | 2.8472878 |
| Sum | 44197.59 |
| Variance | 5997.9478 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31.7 | 3 | 0.5% |
| 0 | 3 | 0.5% |
| 25.93 | 2 | 0.3% |
| 22.5 | 2 | 0.3% |
| 24.54 | 2 | 0.3% |
| 16 | 2 | 0.3% |
| 9.2 | 2 | 0.3% |
| 10.3 | 2 | 0.3% |
| 0.02 | 2 | 0.3% |
| 0.54 | 2 | 0.3% |
| Other values (613) | 628 | |
| (Missing) | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 0.02 | 2 | |
| 0.03 | 2 | |
| 0.11 | 1 | 0.2% |
| 0.22 | 1 | 0.2% |
| 0.38 | 1 | 0.2% |
| 0.41 | 1 | 0.2% |
| 0.54 | 2 | |
| 0.58 | 1 | 0.2% |
| 0.97 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 743.8 | 1 | |
| 530.92 | 1 | |
| 415 | 1 | |
| 402.1 | 1 | |
| 381.01 | 1 | |
| 352.39 | 1 | |
| 336.53 | 1 | |
| 334.19 | 1 | |
| 322.72 | 1 | |
| 319.25 | 1 |
foreign gross
Real number (ℝ)
High correlation  Missing 
| Distinct | 577 |
|---|---|
| Distinct (%) | 96.0% |
| Missing | 55 |
| Missing (%) | 8.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.978687 |
| Minimum | 0 |
|---|---|
| Maximum | 1969 |
| Zeros | 6 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.58 |
| Q1 | 11.22 |
| median | 37.84 |
| Q3 | 93.74 |
| 95-th percentile | 389.03 |
| Maximum | 1969 |
| Range | 1969 |
| Interquartile range (IQR) | 82.52 |
Descriptive statistics
| Standard deviation | 151.33161 |
|---|---|
| Coefficient of variation (CV) | 1.720094 |
| Kurtosis | 44.468603 |
| Mean | 87.978687 |
| Median Absolute Deviation (MAD) | 32.45 |
| Skewness | 5.0723883 |
| Sum | 52875.191 |
| Variance | 22901.256 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6 | 0.9% |
| 2.5 | 2 | 0.3% |
| 16.8 | 2 | 0.3% |
| 20.8 | 2 | 0.3% |
| 17 | 2 | 0.3% |
| 5.5 | 2 | 0.3% |
| 6.7 | 2 | 0.3% |
| 5.8 | 2 | 0.3% |
| 3.1 | 2 | 0.3% |
| 1.11 | 2 | 0.3% |
| Other values (567) | 577 | |
| (Missing) | 55 | 8.4% |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.01 | 2 | 0.3% |
| 0.02 | 1 | 0.2% |
| 0.029 | 1 | 0.2% |
| 0.062 | 1 | 0.2% |
| 0.12 | 1 | 0.2% |
| 0.14 | 2 | 0.3% |
| 0.15 | 1 | 0.2% |
| 0.17 | 1 | 0.2% |
| 0.23 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1969 | 1 | |
| 947.1 | 1 | |
| 802.8 | 1 | |
| 770.81 | 1 | |
| 690.2 | 1 | |
| 687.9 | 1 | |
| 660 | 1 | |
| 651.58 | 1 | |
| 648.16 | 1 | |
| 647.88 | 1 |
worldwide gross
Real number (ℝ)
High correlation 
| Distinct | 634 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 4 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 151.07402 |
| Minimum | 0.03 |
|---|---|
| Maximum | 2712.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0.03 |
|---|---|
| 5-th percentile | 8.6585 |
| Q1 | 34.815 |
| median | 75.7 |
| Q3 | 176.1825 |
| 95-th percentile | 576.367 |
| Maximum | 2712.85 |
| Range | 2712.82 |
| Interquartile range (IQR) | 141.3675 |
Descriptive statistics
| Standard deviation | 216.96465 |
|---|---|
| Coefficient of variation (CV) | 1.436148 |
| Kurtosis | 33.960663 |
| Mean | 151.07402 |
| Median Absolute Deviation (MAD) | 53.495 |
| Skewness | 4.3536168 |
| Sum | 98500.26 |
| Variance | 47073.658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.03 | 2 | 0.3% |
| 93.4 | 2 | 0.3% |
| 63.8 | 2 | 0.3% |
| 73.8 | 2 | 0.3% |
| 183.3 | 2 | 0.3% |
| 1.1 | 2 | 0.3% |
| 29 | 2 | 0.3% |
| 73.4 | 2 | 0.3% |
| 29.37 | 2 | 0.3% |
| 34.5 | 2 | 0.3% |
| Other values (624) | 632 | |
| (Missing) | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 0.03 | 2 | |
| 0.38 | 1 | |
| 0.41 | 1 | |
| 0.55 | 1 | |
| 0.64 | 1 | |
| 1.07 | 1 | |
| 1.1 | 2 | |
| 1.32 | 1 | |
| 1.57 | 1 | |
| 2.71 | 1 |
| Value | Count | Frequency (%) |
| 2712.85 | 1 | |
| 1328.11 | 1 | |
| 1123.2 | 1 | |
| 1063.16 | 1 | |
| 1043.87 | 1 | |
| 1024.39 | 1 | |
| 996.9 | 1 | |
| 961 | 1 | |
| 955.41 | 1 | |
| 939.88 | 1 |
budget
Real number (ℝ)
High correlation  Missing 
| Distinct | 135 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 11 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.096147 |
| Minimum | 0 |
|---|---|
| Maximum | 300 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 20 |
| median | 35 |
| Q3 | 70 |
| 95-th percentile | 166.6 |
| Maximum | 300 |
| Range | 300 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 51.370097 |
|---|---|
| Coefficient of variation (CV) | 0.94960732 |
| Kurtosis | 2.7571938 |
| Mean | 54.096147 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 1.706458 |
| Sum | 34892.015 |
| Variance | 2638.8869 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 49 | 7.5% |
| 30 | 33 | 5.0% |
| 25 | 29 | 4.4% |
| 40 | 28 | 4.3% |
| 35 | 25 | 3.8% |
| 15 | 24 | 3.7% |
| 60 | 21 | 3.2% |
| 150 | 20 | 3.0% |
| 50 | 19 | 2.9% |
| 80 | 18 | 2.7% |
| Other values (125) | 379 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 0.015 | 1 | 0.2% |
| 0.2 | 1 | 0.2% |
| 0.5 | 1 | 0.2% |
| 1.5 | 1 | 0.2% |
| 1.7 | 1 | 0.2% |
| 1.8 | 1 | 0.2% |
| 2 | 3 | |
| 2.5 | 1 | 0.2% |
| 2.6 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 300 | 1 | 0.2% |
| 260 | 1 | 0.2% |
| 258 | 1 | 0.2% |
| 250 | 2 | 0.3% |
| 237 | 1 | 0.2% |
| 230 | 1 | 0.2% |
| 210 | 1 | 0.2% |
| 200 | 11 | |
| 195 | 1 | 0.2% |
| 190 | 1 | 0.2% |
profit
Real number (ℝ)
High correlation 
| Distinct | 637 |
|---|---|
| Distinct (%) | 97.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.963788 |
| Minimum | -111.01 |
|---|---|
| Maximum | 2475.85 |
| Zeros | 4 |
| Zeros (%) | 0.6% |
| Negative | 107 |
| Negative (%) | 16.3% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | -111.01 |
|---|---|
| 5-th percentile | -18.1175 |
| Q1 | 6.55 |
| median | 39.925 |
| Q3 | 117.9225 |
| 95-th percentile | 438.6375 |
| Maximum | 2475.85 |
| Range | 2586.86 |
| Interquartile range (IQR) | 111.3725 |
Descriptive statistics
| Standard deviation | 182.58967 |
|---|---|
| Coefficient of variation (CV) | 1.8830708 |
| Kurtosis | 48.681841 |
| Mean | 96.963788 |
| Median Absolute Deviation (MAD) | 39.27 |
| Skewness | 5.2178514 |
| Sum | 63608.245 |
| Variance | 33338.989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13.5 | 4 | 0.6% |
| 0 | 4 | 0.6% |
| -6.1 | 2 | 0.3% |
| 38.3 | 2 | 0.3% |
| 14 | 2 | 0.3% |
| 11.2 | 2 | 0.3% |
| 3.3 | 2 | 0.3% |
| 1.12 | 2 | 0.3% |
| 26.8 | 2 | 0.3% |
| -2.9 | 2 | 0.3% |
| Other values (627) | 632 |
| Value | Count | Frequency (%) |
| -111.01 | 1 | |
| -88.5 | 1 | |
| -58.8 | 1 | |
| -50.5 | 1 | |
| -50 | 1 | |
| -44.83 | 1 | |
| -41.2 | 1 | |
| -39.83 | 1 | |
| -37.66 | 1 | |
| -36.1 | 1 |
| Value | Count | Frequency (%) |
| 2475.85 | 1 | |
| 1203.11 | 1 | |
| 928.2 | 1 | |
| 863.16 | 1 | |
| 830.41 | 1 | |
| 824.39 | 1 | |
| 811.9 | 1 | |
| 794.5 | 1 | |
| 793.87 | 1 | |
| 789.88 | 1 |
proftitability
Text
Missing 
| Distinct | 583 |
|---|---|
| Distinct (%) | 90.4% |
| Missing | 11 |
| Missing (%) | 1.7% |
| Memory size | 34.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.5643411 |
| Min length | 1 |
Unique
| Unique | 534 ? |
|---|---|
| Unique (%) | 82.8% |
Sample
| 1st row | 337.39% |
|---|---|
| 2nd row | 330.46% |
| 3rd row | 512.20% |
| 4th row | 267.53% |
| 5th row | 3.20% |
| Value | Count | Frequency (%) |
| 1.93 | 5 | 0.8% |
| 2.1 | 4 | 0.6% |
| 0.9 | 3 | 0.5% |
| 6.83 | 3 | 0.5% |
| 1.47 | 3 | 0.5% |
| 1.3 | 3 | 0.5% |
| 1.07 | 3 | 0.5% |
| 2.09 | 3 | 0.5% |
| 2.03 | 3 | 0.5% |
| 1.73 | 3 | 0.5% |
| Other values (572) | 612 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 642 | |
| 1 | 374 | |
| % | 360 | |
| 2 | 338 | |
| 3 | 313 | |
| 0 | 269 | |
| 4 | 233 | 6.5% |
| 5 | 223 | 6.2% |
| 8 | 217 | 6.0% |
| 7 | 214 | 6.0% |
| Other values (10) | 406 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3589 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 642 | |
| 1 | 374 | |
| % | 360 | |
| 2 | 338 | |
| 3 | 313 | |
| 0 | 269 | |
| 4 | 233 | 6.5% |
| 5 | 223 | 6.2% |
| 8 | 217 | 6.0% |
| 7 | 214 | 6.0% |
| Other values (10) | 406 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3589 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 642 | |
| 1 | 374 | |
| % | 360 | |
| 2 | 338 | |
| 3 | 313 | |
| 0 | 269 | |
| 4 | 233 | 6.5% |
| 5 | 223 | 6.2% |
| 8 | 217 | 6.0% |
| 7 | 214 | 6.0% |
| Other values (10) | 406 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3589 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 642 | |
| 1 | 374 | |
| % | 360 | |
| 2 | 338 | |
| 3 | 313 | |
| 0 | 269 | |
| 4 | 233 | 6.5% |
| 5 | 223 | 6.2% |
| 8 | 217 | 6.0% |
| 7 | 214 | 6.0% |
| Other values (10) | 406 |
opening weekend
Real number (ℝ)
High correlation 
| Distinct | 483 |
|---|---|
| Distinct (%) | 74.3% |
| Missing | 6 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.275206 |
| Minimum | 0 |
|---|---|
| Maximum | 169.19 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.3315 |
| Q1 | 7 |
| median | 13.93 |
| Q3 | 26.625 |
| 95-th percentile | 65.951 |
| Maximum | 169.19 |
| Range | 169.19 |
| Interquartile range (IQR) | 19.625 |
Descriptive statistics
| Standard deviation | 23.515448 |
|---|---|
| Coefficient of variation (CV) | 1.1052982 |
| Kurtosis | 9.7180121 |
| Mean | 21.275206 |
| Median Absolute Deviation (MAD) | 8.53 |
| Skewness | 2.7053204 |
| Sum | 13828.884 |
| Variance | 552.97628 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 5 | 0.8% |
| 17.6 | 5 | 0.8% |
| 6.9 | 5 | 0.8% |
| 5.4 | 4 | 0.6% |
| 4.7 | 4 | 0.6% |
| 10.6 | 4 | 0.6% |
| 0.11 | 4 | 0.6% |
| 7 | 4 | 0.6% |
| 7.6 | 4 | 0.6% |
| 12.3 | 4 | 0.6% |
| Other values (473) | 607 | |
| (Missing) | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.01 | 1 | |
| 0.02 | 2 | |
| 0.032 | 1 | |
| 0.037 | 1 | |
| 0.05 | 2 | |
| 0.075 | 1 | |
| 0.08 | 1 | |
| 0.09 | 2 | |
| 0.097 | 1 |
| Value | Count | Frequency (%) |
| 169.19 | 1 | |
| 158.4 | 1 | |
| 151.1 | 1 | |
| 142.8 | 1 | |
| 138.12 | 1 | |
| 128.1 | 1 | |
| 125 | 1 | |
| 121.6 | 1 | |
| 116.1 | 1 | |
| 114.7 | 1 |
oscar
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 640 |
| Missing (%) | 97.6% |
| Memory size | 21.3 KiB |
Length
| Max length | 60 |
|---|---|
| Median length | 36.5 |
| Mean length | 23.375 |
| Min length | 8 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 56.2% |
Sample
| 1st row | Best Actress |
|---|---|
| 2nd row | Sup. Actor, Sup. Actress |
| 3rd row | Best Picture, Director, Actor, Orig. Screenplay |
| 4th row | Original Screenplay |
| 5th row | Best Picture, Director, Supporting Actor, Adapted Screenplay |
| Value | Count | Frequency (%) |
| best | 8 | |
| actor | 7 | |
| screenplay | 6 | |
| actress | 4 | |
| supporting | 4 | |
| picture | 4 | |
| director | 4 | |
| animated | 3 | 6.2% |
| original | 3 | 6.2% |
| sup | 2 | 4.2% |
| Other values (2) | 3 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 37 | 9.9% |
| e | 37 | 9.9% |
| t | 36 | 9.6% |
| 32 | 8.6% | |
| c | 25 | 6.7% |
| i | 22 | 5.9% |
| p | 18 | 4.8% |
| s | 16 | 4.3% |
| n | 16 | 4.3% |
| A | 16 | 4.3% |
| Other values (15) | 119 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 374 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 37 | 9.9% |
| e | 37 | 9.9% |
| t | 36 | 9.6% |
| 32 | 8.6% | |
| c | 25 | 6.7% |
| i | 22 | 5.9% |
| p | 18 | 4.8% |
| s | 16 | 4.3% |
| n | 16 | 4.3% |
| A | 16 | 4.3% |
| Other values (15) | 119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 374 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 37 | 9.9% |
| e | 37 | 9.9% |
| t | 36 | 9.6% |
| 32 | 8.6% | |
| c | 25 | 6.7% |
| i | 22 | 5.9% |
| p | 18 | 4.8% |
| s | 16 | 4.3% |
| n | 16 | 4.3% |
| A | 16 | 4.3% |
| Other values (15) | 119 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 374 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 37 | 9.9% |
| e | 37 | 9.9% |
| t | 36 | 9.6% |
| 32 | 8.6% | |
| c | 25 | 6.7% |
| i | 22 | 5.9% |
| p | 18 | 4.8% |
| s | 16 | 4.3% |
| n | 16 | 4.3% |
| A | 16 | 4.3% |
| Other values (15) | 119 |
bafta
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 644 |
| Missing (%) | 98.2% |
| Memory size | 21.0 KiB |
Length
| Max length | 39 |
|---|---|
| Median length | 26 |
| Mean length | 18.666667 |
| Min length | 8 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 58.3% |
Sample
| 1st row | Original Screenplay |
|---|---|
| 2nd row | Supporting Actor, Director |
| 3rd row | Animated |
| 4th row | Supporting Actress |
| 5th row | Leading Actor |
| Value | Count | Frequency (%) |
| supporting | 4 | |
| actor | 4 | |
| screenplay | 4 | |
| animated | 3 | |
| director | 3 | |
| original | 2 | |
| film | 2 | |
| adapted | 2 | |
| actress | 1 | 3.7% |
| leading | 1 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 21 | 9.4% |
| e | 19 | 8.5% |
| t | 18 | 8.0% |
| i | 17 | 7.6% |
| 15 | 6.7% | |
| p | 14 | 6.2% |
| n | 14 | 6.2% |
| c | 12 | 5.4% |
| a | 12 | 5.4% |
| o | 11 | 4.9% |
| Other values (15) | 71 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 224 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 21 | 9.4% |
| e | 19 | 8.5% |
| t | 18 | 8.0% |
| i | 17 | 7.6% |
| 15 | 6.7% | |
| p | 14 | 6.2% |
| n | 14 | 6.2% |
| c | 12 | 5.4% |
| a | 12 | 5.4% |
| o | 11 | 4.9% |
| Other values (15) | 71 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 224 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 21 | 9.4% |
| e | 19 | 8.5% |
| t | 18 | 8.0% |
| i | 17 | 7.6% |
| 15 | 6.7% | |
| p | 14 | 6.2% |
| n | 14 | 6.2% |
| c | 12 | 5.4% |
| a | 12 | 5.4% |
| o | 11 | 4.9% |
| Other values (15) | 71 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 224 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 21 | 9.4% |
| e | 19 | 8.5% |
| t | 18 | 8.0% |
| i | 17 | 7.6% |
| 15 | 6.7% | |
| p | 14 | 6.2% |
| n | 14 | 6.2% |
| c | 12 | 5.4% |
| a | 12 | 5.4% |
| o | 11 | 4.9% |
| Other values (15) | 71 |
source
Text
Missing 
| Distinct | 223 |
|---|---|
| Distinct (%) | 73.1% |
| Missing | 351 |
| Missing (%) | 53.5% |
| Memory size | 41.3 KiB |
Length
| Max length | 140 |
|---|---|
| Median length | 107 |
| Mean length | 52.363934 |
| Min length | 24 |
Unique
| Unique | 218 ? |
|---|---|
| Unique (%) | 71.5% |
Sample
| 1st row | http://boxofficemojo.com/movies/?id=127hours.htm |
|---|---|
| 2nd row | http://www.the-numbers.com/movies/2009/ABSTV.php |
| 3rd row | http://www.wikipedia.org |
| 4th row | http://boxofficemojo.com/movies |
| 5th row | http://boxofficemojo.com/movies |
| Value | Count | Frequency (%) |
| http://www.the-numbers.com/movies/records/allbudgets.php | 66 | 20.3% |
| http://boxofficemojo.com/movies | 13 | 4.0% |
| http://www.the-numbers.com/movies/2009/abstv.php | 4 | 1.2% |
| http://latimesblogs.latimes.com/entertainmentnewsbuzz/2011/10/movie-projector-real-steel-ides-of-march.html | 2 | 0.6% |
| http://latimesblogs.latimes.com/entertainmentnewsbuzz/2011/11/muppets-arthur-christmas-hugo-box-office.html | 2 | 0.6% |
| 2 | 0.6% | |
| unofficial | 1 | 0.3% |
| for | 1 | 0.3% |
| links | 1 | 0.3% |
| see | 1 | 0.3% |
| Other values (232) | 232 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1510 | 9.5% |
| / | 1327 | 8.3% |
| t | 1149 | 7.2% |
| e | 1118 | 7.0% |
| m | 1111 | 7.0% |
| i | 830 | 5.2% |
| h | 790 | 4.9% |
| . | 720 | 4.5% |
| s | 699 | 4.4% |
| p | 602 | 3.8% |
| Other values (66) | 6115 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15971 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1510 | 9.5% |
| / | 1327 | 8.3% |
| t | 1149 | 7.2% |
| e | 1118 | 7.0% |
| m | 1111 | 7.0% |
| i | 830 | 5.2% |
| h | 790 | 4.9% |
| . | 720 | 4.5% |
| s | 699 | 4.4% |
| p | 602 | 3.8% |
| Other values (66) | 6115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15971 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1510 | 9.5% |
| / | 1327 | 8.3% |
| t | 1149 | 7.2% |
| e | 1118 | 7.0% |
| m | 1111 | 7.0% |
| i | 830 | 5.2% |
| h | 790 | 4.9% |
| . | 720 | 4.5% |
| s | 699 | 4.4% |
| p | 602 | 3.8% |
| Other values (66) | 6115 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15971 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1510 | 9.5% |
| / | 1327 | 8.3% |
| t | 1149 | 7.2% |
| e | 1118 | 7.0% |
| m | 1111 | 7.0% |
| i | 830 | 5.2% |
| h | 790 | 4.9% |
| . | 720 | 4.5% |
| s | 699 | 4.4% |
| p | 602 | 3.8% |
| Other values (66) | 6115 |
column
URL
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 655 |
| Missing (%) | 99.8% |
| Memory size | 20.7 KiB |
| http://www.shadowandact.com/?p=23430 | 1 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| http://www.shadowandact.com/?p=23430 | 1 | 0.2% |
| (Missing) | 655 |
| Value | Count | Frequency (%) |
| http | 1 | 0.2% |
| (Missing) | 655 |
| Value | Count | Frequency (%) |
| www.shadowandact.com | 1 | 0.2% |
| (Missing) | 655 |
| Value | Count | Frequency (%) |
| / | 1 | 0.2% |
| (Missing) | 655 |
| Value | Count | Frequency (%) |
| p=23430 | 1 | 0.2% |
| (Missing) | 655 |
| Value | Count | Frequency (%) |
| 1 | 0.2% | |
| (Missing) | 655 |
Interactions
Correlations
| audience score | box office average per cinema | budget | domestic gross | foreign gross | genre | id | lead studio | number of theatres in opening weekend | opening weekend | profit | rotten tomatoes | story | worldwide gross | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| audience score | 1.000 | 0.435 | 0.162 | 0.407 | 0.359 | 0.000 | -0.018 | 0.000 | 0.160 | 0.270 | 0.397 | 0.670 | 0.125 | 0.369 | 0.000 |
| box office average per cinema | 0.435 | 1.000 | 0.395 | 0.766 | 0.607 | 0.108 | -0.041 | 0.000 | 0.472 | 0.776 | 0.698 | 0.282 | 0.067 | 0.703 | 0.000 |
| budget | 0.162 | 0.395 | 1.000 | 0.650 | 0.697 | 0.149 | -0.018 | 0.146 | 0.721 | 0.644 | 0.422 | -0.003 | 0.128 | 0.709 | 0.000 |
| domestic gross | 0.407 | 0.766 | 0.650 | 1.000 | 0.802 | 0.052 | -0.036 | 0.000 | 0.765 | 0.925 | 0.866 | 0.160 | 0.086 | 0.938 | 0.147 |
| foreign gross | 0.359 | 0.607 | 0.697 | 0.802 | 1.000 | 0.000 | -0.074 | 0.219 | 0.741 | 0.771 | 0.846 | 0.130 | 0.089 | 0.941 | 0.128 |
| genre | 0.000 | 0.108 | 0.149 | 0.052 | 0.000 | 1.000 | 1.000 | 0.057 | 0.191 | 0.120 | 0.135 | 0.091 | 0.350 | 0.000 | 0.000 |
| id | -0.018 | -0.041 | -0.018 | -0.036 | -0.074 | 1.000 | 1.000 | 1.000 | -0.064 | -0.021 | -0.061 | -0.064 | 1.000 | -0.054 | 1.000 |
| lead studio | 0.000 | 0.000 | 0.146 | 0.000 | 0.219 | 0.057 | 1.000 | 1.000 | 0.111 | 0.000 | 0.212 | 0.054 | 0.086 | 0.106 | 0.404 |
| number of theatres in opening weekend | 0.160 | 0.472 | 0.721 | 0.765 | 0.741 | 0.191 | -0.064 | 0.111 | 1.000 | 0.823 | 0.635 | -0.023 | 0.124 | 0.779 | 0.190 |
| opening weekend | 0.270 | 0.776 | 0.644 | 0.925 | 0.771 | 0.120 | -0.021 | 0.000 | 0.823 | 1.000 | 0.799 | 0.033 | 0.087 | 0.886 | 0.000 |
| profit | 0.397 | 0.698 | 0.422 | 0.866 | 0.846 | 0.135 | -0.061 | 0.212 | 0.635 | 0.799 | 1.000 | 0.179 | 0.116 | 0.910 | 0.000 |
| rotten tomatoes | 0.670 | 0.282 | -0.003 | 0.160 | 0.130 | 0.091 | -0.064 | 0.054 | -0.023 | 0.033 | 0.179 | 1.000 | 0.061 | 0.130 | 0.000 |
| story | 0.125 | 0.067 | 0.128 | 0.086 | 0.089 | 0.350 | 1.000 | 0.086 | 0.124 | 0.087 | 0.116 | 0.061 | 1.000 | 0.021 | 0.124 |
| worldwide gross | 0.369 | 0.703 | 0.709 | 0.938 | 0.941 | 0.000 | -0.054 | 0.106 | 0.779 | 0.886 | 0.910 | 0.130 | 0.021 | 1.000 | 0.109 |
| year | 0.000 | 0.000 | 0.000 | 0.147 | 0.128 | 0.000 | 1.000 | 0.404 | 0.190 | 0.000 | 0.000 | 0.000 | 0.124 | 0.109 | 1.000 |
Missing values
Sample
| movies_id | id | year | exclude | film | lead studio | rotten tomatoes | audience score | story | genre | number of theatres in opening weekend | box office average per cinema | domestic gross | foreign gross | worldwide gross | budget | profit | proftitability | opening weekend | oscar | bafta | source | column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | movies-0000 | 1 | 201 | None | 127 Hours | Independent | 93.0 | 84 | Escape | Adventure | 916.0 | 2333.0 | 18.33 | 42.4 | 60.73 | 18.0 | 42.73 | 337.39% | 0.260 | None | None | http://boxofficemojo.com/movies/?id=127hours.htm | None |
| 1 | movies-0001 | 2 | 201 | None | A Nightmare on Elm Street | Warner Bros. | 13.0 | 40 | Monster Force | Horror | 3332.0 | 9875.0 | 63.08 | 52.59 | 115.66 | 35.0 | 80.66 | 330.46% | 32.900 | None | None | None | None |
| 2 | movies-0002 | 3 | 201 | None | Alice in Wonderland | Disney | 52.0 | 72 | Journey And Return | Adventure | 3728.0 | 31143.0 | 334.19 | 690.2 | 1024.39 | 200.0 | 824.39 | 512.20% | 116.100 | None | None | None | None |
| 3 | movies-0003 | 4 | 201 | None | All About Steve | Independent | 6.0 | 35 | Comedy | Comedy | 2251.0 | 4994.0 | 33.86 | 6.26 | 40.13 | 15.0 | 25.13 | 267.53% | 11.200 | None | None | http://www.the-numbers.com/movies/2009/ABSTV.php | None |
| 4 | movies-0004 | 5 | 201 | True | All Good Things | Independent | 33.0 | 64 | The Riddle | Drama | 2.0 | NaN | 0.58 | 0.062 | 0.64 | 20.0 | -19.36 | 3.20% | 0.037 | None | None | http://www.wikipedia.org | None |
| 5 | movies-0005 | 6 | 201 | None | Alpha and Omega | Crest | 17.0 | 41 | Journey And Return | Animation | 2625.0 | 3469.0 | 25.12 | 4.8 | 29.91 | 20.0 | 9.91 | 149.55% | 9.100 | None | None | None | None |
| 6 | movies-0006 | 7 | 201 | True | Barry Munday | Independent | 43.0 | 43 | Maturation | Comedy | NaN | NaN | None | None | None | NaN | 0.00 | None | NaN | None | None | None | None |
| 7 | movies-0007 | 8 | 201 | None | Black Swan | Fox | 88.0 | 86 | Wretched Excess | Drama | 959.0 | 8742.0 | 106.95 | 222.44 | 329.39 | 13.0 | 316.39 | 2533.77% | 8.380 | Best Actress | None | None | None |
| 8 | movies-0008 | 9 | 201 | None | Brooklyn's Finest | Independent | 42.0 | 47 | Temptation | Action | 1936.0 | 6896.0 | 27.2 | 9.15 | 36.31 | 17.0 | 19.31 | 213.59% | 13.400 | None | None | http://boxofficemojo.com/movies | None |
| 9 | movies-0009 | 10 | 201 | None | Buried | Independent | 86.0 | 63 | Escape | Drama | 11.0 | 9115.0 | 1.04 | 17.33 | 18.38 | 2.0 | 16.38 | 919.00% | 0.103 | None | None | http://boxofficemojo.com/movies | None |
| movies_id | id | year | exclude | film | lead studio | rotten tomatoes | audience score | story | genre | number of theatres in opening weekend | box office average per cinema | domestic gross | foreign gross | worldwide gross | budget | profit | proftitability | opening weekend | oscar | bafta | source | column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 646 | movies-0646 | 647 | 200 | None | Up | Disney | 98.0 | 86 | Journey And Return | Animation | 3766.0 | 18085.0 | 293.0 | 434 | 727.1 | 175.0 | 552.10 | 4.15 | 68.10 | Animated | Animated | http://www.the-numbers.com/movies/2009/UP.php | None |
| 647 | movies-0647 | 648 | 200 | None | Up in the Air | Paramount | 90.0 | 76 | Maturation | Drama | 1895.0 | 5947.0 | 83.82 | 78.2 | 162.02 | 25.0 | 137.02 | 6.48 | 11.20 | None | Adapted Screenplay | http://boxofficemojo.com/movies/?id=upintheair.htm | None |
| 648 | movies-0648 | 649 | 200 | None | Watchmen | Warner Bros. | 64.0 | 68 | Sacrifice | Action | 3611.0 | 15291.0 | 107.5 | 77.7 | 185.3 | 138.0 | 47.30 | 1.34 | 55.20 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |
| 649 | movies-0649 | 650 | 200 | True | Whatever Works | None | 50.0 | 63 | Discovery | Comedy | 9.0 | 29574.0 | 5.3 | 23.7 | 29.0 | 15.0 | 14.00 | 1.93 | 0.26 | None | None | http://en.wikipedia.org/wiki/Whatever_Works | None |
| 650 | movies-0650 | 651 | 200 | None | Where the Wild Things Are | Warner Bros. | 73.0 | 59 | Journey And Return | Adventure | 3735.0 | 8754.0 | 63.4 | 16.3 | 85.3 | 100.0 | -14.70 | 0.85 | 32.70 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |
| 651 | movies-0651 | 652 | 200 | True | Whip It | None | 84.0 | 73 | Maturation | Drama | 1721.0 | 2702.0 | 13.0 | 3 | 16.0 | 15.0 | 1.00 | 1.07 | 4.70 | None | None | http://www.boxofficemojo.com/movies/?id=whipit.htm | None |
| 652 | movies-0652 | 653 | 200 | True | Whiteout | None | 7.0 | 28 | Pursuit | Action | 2745.0 | 1791.0 | 10.3 | 1.9 | 12.2 | 35.0 | -22.80 | 0.35 | 4.90 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |
| 653 | movies-0653 | 654 | 200 | None | X-Men Origins: Wolverine | Fox | 37.0 | 72 | Revenge | Action | 4099.0 | 20751.0 | 179.9 | 193.2 | 373.1 | 150.0 | 223.10 | 2.49 | 85.10 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |
| 654 | movies-0654 | 655 | 200 | True | Year One | None | 14.0 | 31 | Quest | Adventure | 3022.0 | 6489.0 | 32.4 | 26.2 | 60.2 | 60.0 | 0.20 | 1 | 19.60 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |
| 655 | movies-0655 | 656 | 200 | None | Zombieland | Sony | 90.0 | 87 | Monster Force | Action | 3036.0 | 8147.0 | 49.2 | 42.5 | 93.3 | 23.6 | 69.70 | 3.95 | 24.70 | None | None | http://www.the-numbers.com/movies/records/allbudgets.php | None |